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Abstract 

For the analysis of microbial community structure based on 16S rDNA sequence diversity, sensitive and robust PGR 
amplification of 16S rDNA is a critical step. To obtain accurate microbial composition data, PGR amplification must be free of 
bias; however, amplifying all 16S rDNA species with equal efficiency from a sample containing a large variety of 
microorganisms remains challenging. Here, we designed a universal primer based on the V3-V4 hypervariable region of 
prokaryotic 16S rDNA for the simultaneous detection of Bacteria and Archaea in fecal samples from crossbred pigs 
(LandracexLarge white xDuroc) using an lllumina MiSeq next-generation sequencer. In-silico analysis showed that the 
newly designed universal prokaryotic primers matched approximately 98.0% of Bacteria and 94.6% of Archaea rRNA gene 
sequences in the Ribosomal Database Project database. For each sequencing reaction performed with the prokaryotic 
universal primer, an average of 69,330 (±20,482) reads were obtained, of which archaeal rRNA genes comprised 
approximately 1.2% to 3.2% of all prokaryotic reads. In addition, the detection frequency of Bacteria belonging to the 
phylum Verrucomicrobia, including members of the classes Verrucomlcrobiae and Opitutae, was higher in the NGS analysis 
using the prokaryotic universal primer than that performed with the bacterial universal primer. Importantly, this new 
prokaryotic universal primer set had markedly lower bias than that of most previously designed universal primers. Our 
findings demonstrate that the prokaryotic universal primer set designed in the present study will permit the simultaneous 
detection of Bacteria and Archaea, and will therefore allow for a more comprehensive understanding of microbial 
community structures in environmental samples. 
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Introduction 

The analysis of 16S rRNA (16S rDNA) provides valuable 
phylogenetic information for the comparison of microbial diversity 
in environmental samples. A number of molecular biological 
techniques based on 16S rDNA gene sequence diversity have been 
developed for investigating microbial community structure. 
Among these techniques, fingerprinting methods, such as dena- 
turing gradient gel electrophoresis (DGGE) [1-3] and terminal- 
restriction fragment length polymorphism (T-RFLP) [4—6], are 
frequently used due to their ease of use and relatively low cost [7]. 
The construction of 16S rDNA clone libraries is also extensively 
used to study microbial community structure [2,4,8]. However, as 
these culture-independent approaches are estimated to detect only 
0.1%~1% of the total microbial population, they are not optimal 
for the analysis of subdominant microbial species or groups [9, 1 0] . 

In recent years, the development of next-generation sequencing 
(NGS) technologies has permitted in-depth sequencing and data 
analyses of various types of environmental samples [11-15] at a 
deeper level than possible with standard molecular biological 
techniques [16]. In particular, lUumina-based strategies, which 



provide paired reads of the same DNA fragment, offer multiplex- 
ing capability and generate large amounts of sequence data 
[16,17]. Of the commercially available lllumina platforms, the 
MiSeq sequencer has the greatest potential for 16S rDNA 
sequence studies, because it generates sequence reads of up to 
600 bp and has a performance to cost ratio that is manageable for 
average-sized research laboratories [18]. The generation of longer 
sequence reads is an important feature of this platform, as they are 
easier to assign to taxonomic groups [19]. In addition, because the 
MiSeq lllumina platform allows for deeper sequencing than 
conventional approaches, it allows for the detection and analysis of 
subdominant microbial species or groups [1 1] . Therefore, the 
combination of this NGS technology with robust prokaryotic 
universal primer sets will allow for the more accurate determina- 
tion of the Bacter ia to Archaea ratio than conventional approach- 
es. Several studies have successfully applied prokaryotic universal 
primer sets and NGS platforms for microbial community structure 
analysis of environmental and clinical samples, including Artie sea 
ice, soil, and urinary catheters [20-22]. However, the phylogenetic 
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specificity and degree of bias of these prokaryotic universal primers 
remain unclear. 

In the present study, we designed universal primer sets for the 
specific detection of the V3-V4 regions of prokaryotic 16S rDNA, 
including the domains Bacteria and Archaea. In addition, we 
dc\clopc'd a simultaneous analysis system for Bacteria and Archaea 
based on lUumina MiSeq next-generation sequencing technology. 

Materials and Methods 

Fecal samples 

Fecal samples were collected from three 1 1 -month-old cross- 
bred pigs (Landrace xLarge white xDuroc), which were housed on 
a farm maintained by Kyoto Prefectural University. The breeding 
of pigs adhered to the Guidelines Concerning the Care and Use of 
Laboratory Animals of the Animal Experiment Committee of 
Kyoto Prefectural University. Because the feces were collected 
non-invasi\'cly, a permit number was not required for the research. 
Feces samples were kindly collected by Dr. K. Ushida and Dr. S. 
Tsuchida (Kyoto Prefectural University) and were then transport- 
ed to our laboratory. The feces from individual pigs were divided 
into 5-g samples and then frozen at — 80°C until use. 

DNA extraction 

Frozen fecal samples were thawed on ice, 100 mg of each 
sample was suspended in 4 M guanidium thiocyanate, 100 mM 
Tris-HCl (pH 9.0), and 40 mM EDTA, and die samples were 
then beaten with zirconia beads using a FastPrep FPIOOA 
instrument (MP Biomedicals, USA). DNA was extracted from 
the bead-treated suspensions using a Magtration System 12GC 
and GC series MagDEA DNA 200 (Precision System Science, 
Japan). DNA concentrations were estimated by spectrophotometry 
using an ND-1000 instrument (NanDrop Technologies, USA), and 
the final concentration of the DNA sample was adjusted to 10 ng/ 
(i,L. 

Development of a prokaryotic universal primer set 

Previously published universal PCR primer sets targeting the 
V3-V4 region of bacterial and archaeal rDNA were modified to 
increase the detection rate of prokar^-otes [23,24]. Multiple 
alignments of 16S rDNA sequences obtained from the DDBJ/ 
GenBank/EMBL database for selected reference organisms were 
performed using the Clustal W program with default settings [25]. 
Based on the alignment results, primer sequences that gave the 
fewest mismatches in the V3-V4 h)rpervariable region of the 16S 
rRNA gene were selected as prokaryotic universal primers. The 
coverage of the previous and present primc-r sets was checked 
against good quality sequences (as defined based on Pintail scores 
by the RDP database) of greater than 1,200 bp and Si 
mismatches [26] in the RDP database (Release 10) using the 
feature Probe Match program [27]. The primer sequences and 
associated PCR assays used in this study are fisted in Table 1. 

Illumina library generation 

The V3-V4 region of 16S rDNA was amplified using Pro341F/ 
Pro805R for Prokaryoles, 341F/R806 for Bacteria, and 
ARC344F/Arch806R for Archaea primer sets (Table 1). In 
addition to the V3-V4 specific priming regions, these primers 
were complementary to standard Illumina forward and reverse 
primers. The reverse primer also contained a 6-bp indexing 
sequence (Table 1) to allow for multiplexing. Amplification 
primers were designed with Illumina adapters. 

To reduce the formation of spurious by-products during the 
amplification process, the touchdown PCR method for thermal 



cycUng was used with a Rotor-Gene Q, quantitative thermal cycler 
(Qiagen, Germany) [28] . The reaction mixture (25 H-L) contained 
10 ng genomic DNA, MightyAmp for Real Time (SYBR Plus) 
(Takara, Japan), and 0.25 |iM of each primer. The PCR reaction 
conditions for amplification of DNA were as follows: initial 
denaturation at 98°C for 2 min, followed by 35 cycles of annealing 
beginning at 65°C and ending at 55°C for 15 sec, and extension at 
68°C for 30 sec. The annealing temperature was lowered 1°C 
every cycle until reaching 55°C, which was used for the remaining 
cycles. PCR products were purified through a MultiScreen 
PCRu9B filter plate (Merck Millipore, USA) and analyzed using 
a Bioanalyzer DNA 1000 Chip Kit (Agilent Technologies, USA) to 
detect j)rimcT-dimers and determine the average molecular weight 
of each product. The purified products were quantified by real- 
time quantitative PCR (q-PCR) on a Rotor-Gene quantitative 
thermal cycler using MightyAmp for Real Time (SYBR Plus), 
0.2 |xM of each primer, which were derived from Illumina 
adapters (Table 1), and serially diluted PhiX control library 
(Illumina, USA) as a standard. The PCR reaction conditions for 
quantification of each PCR product were as following: initial 
denaturation at 98°C for 2 min, followed by 30 cycles of 
denaturation at 98°C for 10 sec, annealing at 60°C for 15 sec, 
and extension at 68°C for 30 sec. The quantification step was used 
to determine the concentration of the amplified libraries and to 
confirm the presence of suitable primers for Illumina sequencing. 

Illumina sequencing and quality filtering 

Each multiplexed library pool was spiked with 25% phiX 
control to improve base calling during sequencing, as recom- 
mended by lUumina for the pooling of two libraries. Sequencing 
was conducted using a paired-end, 2x250-bp cycle run on an 
lUumina MiSeq sequencing system and MiSeq Reagent Nano Kit 
version 2 (500 C)x-Ic"! chemistry. After sequencing was complete, 
image analysis, base* calling, and error estimation were performed 
using Illumina Real-Time Analysis software (version 1.17.28). 

Paired-end sequencing with read lengths of 251 bp was 
performed. After demultiplexing, a clear overlap in the paired- 
end reads was observed. This allowed paired reads to be joined 
together with the fastq-join program (http://code.google.eom/p/ 
ea-utils/). Only reads that that had quality value (QV) scores of ^ 
20 for more than 99% of the sequence were extracted for further 
analysis. All sequences with aml)iguous base- calls wc-r(- discarded. 
The nucleotide sequence dataset was deposited in the Sequence 
Read Archive of the DNA Data Bank of Japan (DDBJ) under the 
accession number DRA002295. 

16S rDNA-based taxonomic analysis 

Analyses of sequence reads were performed manually using the 
Ribosomal Database Project (RDP) Multiclassifier tool [19], which 
is available from the RDP website (http://rdp.cme.msu.edu/ 
classifier/). Reads obtained in the FASTA format were assigned to 
class levels with an 80% confidence threshold. 

Real-time quantitative PCR 

Quantitation of all Bacteria, Archaea, class Thermoplasmata, 
and class Methanobacteria in the pig intestinal tract was performed 
using real-time quantitative PCR (qPCR). To obtain standards for 
the qPCR, purified genomic DNA from Escherichia coli JCM 
1 649 ' and Melhanosarcina acelivorans DSM 2834''' was amplified 
using primer pairs 8F and 1510R or 2 IF and 1510R, respectively 
(Table 1). Extracted fecal DNA was amplified using primer pairs 
Pig Thermo F and Pig Thermo R or Pig Methano F and Pig 
Methano R (Table 1). The PCR reaction mixture (25 ^L) 
contained 20 ng genomic DNA, 2xMightyAmp Buffer Ver.2 
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Table 1. Primers and thermal cycling profiles used in this study. 



Tdrcfst 


Primer ndme 


Oligonucleotide sequence (5'— 3') 


Experiment 


Reference 


Prokaryote 16S rRNA 


rroi41 r 


A AT/" AXA/"/~/~/~/~A/"/"A/"/"/~A/~ AT/ — T A l~ A 1 — T/ — | | | /-/— / — r A /~A/~/~A/~/~/~ 

AA Itj A 1 ALLiCjLL] ALL ALLUACj A 1 L 1 ALAL 1 L 1 1 1 LLL 1 ALALLj ALLiL- 
TCTTCCGATCTCCTACGGGAGGCAGCAGCCTACGGGNBGCASCAG 


rLK tor NGb 


Moaitiea lakai izi\ 




Pro805R 


CAAGCAGAAG ACGGCATACGAG ATN N N N NNGTGACTGG AGTTCA- 
GACGTGTGCTCTTCCGATCTGACTACNVGGGTATCTAATCC 




Modified Herlemann [24] 


Bacteria 16S rRNA 


341 F 


AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTC- 
TTCCGATCTCCTACGGGAGGCAGCAGCCTACGGGAGGCAGCAG 


PGR for NGS 


Muyzer [44] 




R806 


CAAGCAGAAG ACGGCATACGAG ATN N N N NNGTGACTGGAGTTC- 
AGACGTGTGCTCTTCCGATCTGGACTACHVGGGTWTCTAAT 




Caporaso [45] 




341 F 


CCTACGGGAGGCAGCAG 


q-PCR 


Muyzer [44] 




R806 


GGACTACHVGGGTWTCTAAT 




Caporaso [45] 




8F 


AGAGTTTGATCCTGGCTCAG 


PGR 


Edwards [46] 




1510R 


GGTTACCTTGTTACGACTT 




Reysenbacfi [47] 


Archaea 16S rRNA 


ARC344F 


AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCT- 
TCCGATCTCCTACGGGAGGCAGCAGACGGGGYGCAGCAGGCGCGA 


PGR for NGS 


Raskin [48] 




Arch806R 


CAAGCAGAAG ACGGCATACGAG ATN N N N NNGTGACTGGAGTTCA- 
GACGTGTGCTCTTCCGATCTGGACTACVSGGGTATCTAAT 




Takai [23] 




ARC344F 


ACGGGGYGCAGCAGGCGCGA 


q-PCR 


Raskin [48] 




Arch806R 


GGACTACVSGGGTATCTAAT 




Takai [23] 




21 F 


TTCCGGTTGATCCYGCCGGA 


PGR 


DeLong [49] 




1510R 


GGTTACCTTGTTACGACTT 




Reysenbacfi [47] 


lllutmina adapters 
{sequencing) 


Miseq_F 


AATGATACGGCGACCACCGAGAT 


qPCR 


Tfiis study 




Miseq_R 


CAAGCAGAAGACGGCATACGAGAT 






Methanobacteria 


Pig IVlethiano F 


TCCGCAATGTGAGAAATCGC 


qPCR 


Tfiis study 




Pig Metiiano R 


TACCCTGGGAGTACCTCTAACCTCT 






Thermoplasmata 


Pig Tiiermo F 


GAGGGAATTCCTAGTGCTAGGACA 


qPCR 


Tfiis study 




Pig Tiiermo R 


ATCAAACCGGCTACGAACGTT 







Underlined regions indicate lllumina adapter sequences. Bold-face font indicates PCR primer region, which is preceded by a linker sequence. Poly-N string in forward 
primer denotes barcode sequence. 

Barcode sequences used in this study (5'-3'): GATCTG, TCAAGT, CTGATC, AAGCTA, GTAGCC, TACAAG, CGTGAT, ACATCG, and GCCTAA. 
doi:10.1371/journal.pone.0105592.t001 



(Takara), 0.25 |J,M of each primer, and 1.25 units of MightyAmp 
DNA Polymerase (Takara). The cycling conditions were as follows: 
initial denaturation at 98°C for 2 min, followed by 35 cycles of 
98°C for 10 s, 55°C for 15 s, and 68°C for 1 min. The 
amplification products (3 |a.L) were mixed with 1 |j.L EZ-Vision 
One DNA Dye (Amresco Inc., USA) and then separated by 
electrophoresis on 2% agarose gels to confirm the production of a 
single product of the expected molecular weight. The PCR 
products were purified using Econo Spin Ila (Gene Design, Japan) 
and then cloned into pGEM-T Easy vector (Promega, USA) for 
the transformation of E. coli HST08 Premium Competent Cells 
(Takara). Positive transformants were selected on LB agar 
supplemented with ampiciUin (100 Hg/ml), and a single colony 
that was verified by PCR to contain a plasmid with the expected 
insert DNA was grown in 5 ml LB medium supplemented with 
ampicHlin (100 |tg/ml) overnight. The culture was centrifuged at 
7,610 xg' to pellet the cells, and plasmid DNA was then extracted 
from cells using a QIAprep Spin miniprep kit according to the 
manufacturer's instructions (Qiagen). The purified plasmid was 
quantified using a ND-1000 instrument. The number of 16S 
rDNA copies present in the plasmid preparation was estimated 
based on the DNA concentration and molecular mass of pGEM-T 
Easy with the target insert, as previously described [29]. For the 



calculation, the expected weight in Daltons (g/mol) of the plasmid 
construct was first determined using the equation: (length of the 
double-stranded product in base pairs [bp]) * (330 Dax2 
nucleotides/bp). The obtained value was then divided by 
Avogadro's number (6.022x10^'^) to give the number of grams 
per molecule, which was then divided by the total amount of the 
plasmid preparation to give the total number of copies present in 
the sample. The purified plasmid DNA solution was serially 
diluted 10-fold to give solutions ranging from 10' to 10 copies/ 
|j.L. The serially diluted samples were used to generate a standard 
curve that was used to estimate the copy number of the target 
group for each qPCR reaction. 

Q;PCR was performed on a Rotor-Gene Q, quantitative 
thermal cycler using SYBR Premix Ex Taq II (Tli RNaseH Plus) 
(Takara). Each reaction mixture (20 |J.L) contained 20 ng extract- 
ed DNA and 0.2 |J.M of each primer. The cychng conditions were 
as follows: initial denaturation at 95°C for 30 sec, followed by 35 
cycles of 95°C for 5 sec, 60°C (all Bacteria and Archaea), 58°C 
(Thermoplasmata) or 55°C [Methanobacteria) for 20 sec, and 72°C 
for 20 sec. For each reaction, the positive control and negative 
water control were assayed together with the samples. The melting 
curves of the amplified DNA were generated to verify the 
specificity of the reaction. The ratio of the total archaeal 
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Table 2. In-sllico analysis for the coverage rate of predicted primer sets. 



Primer set 


Target 


Hits/queries (£1 mismatch) 


Coverage rate* 


Pro341F/Pro805R 


Bacteria 


1230518/1255860 


98.0% 




Archaea 


22318/23588 


94.6% 


341 F/R806 


Bacteria 


1223304/1255860 


97.4% 


AKCJ44r/ArcnoUDK 


Archaea 


14966/23588 


63.4% 


Uni340F/Uni806R [23] 


Bacteria 


1224925/1255860 


97.5% 




Archaea 


22319/23588 


94.6% 


PRK341F/PRK806R [40] 


Bacteria 


1223340/1255860 


97.4% 




Archaea 


22132/23588 


93.8% 


Primers used in the present and previous studies 

edu/) to evaluate specificity. 

*The coverage of eacfi primer set was calculated 


were submitted to the 'Probe Match program' facility of the 
from the total number of matching sequences. 


Ribosomal Database Project {http://www.rdp.cme.msu. 



doi:l 0.1 371 /journal.pone.Ol 05592.t002 



population to that of prokaryotes was estimated from total 
bacterial and archaeal copy numbers using the following equation: 

A/(A + B):B/(A + B) (1) 

where A is the total copy number oi Bacteria and B is the total 
copy number oi Archaea. Similarly, the ratio ofThermoplasrmta to 
Methanobacteria was also calculated using equation (1), with A 
representing the total copy number of Thermoplasmata and B 
representing the total copy number oi Methanobacteria. 



Results 

Development and coverage of the prokaryotic universal 
primer 

We modified the universal primers Pro341F and Pro805R, 
which target the V3-V4 hypervariable region of 16S rDNA of 
both Bacteria and Archaea, to improve the coverage of existing 
sequences in the RDP database. The results of in-silico analysis for 
primer specificity against 16S rDNA sequences of the newly 
designed primer, and the domain Bacteria- and ^rc/iaea-specific 
universal primers are presented in Table 2. The new prokaryotic 
universal primer matched approximately 98.0% of Bacteria and 
94.6% of Archaea rRNA gene sequences present in the RDP 



100 



80 



O 
O 

c 

60 

■a 
c 



< 
> 



40 



20 



■ II 



■other 

■ Unclassified Bacteria 
Methanobacteria 
Gammaproteobacteria 
Unclassified Bacteroidetes 

' Bacteroidia 

■ Unclassified Firmicutes 
' Erysipelotrichia 

Negativicutes 

■ Bacilli 

' Clostridia 



Pigl 



Pig2 



Pig3 



Figure 1. Class-level NGS analysis results for microbial diversity in pig fecal samples using the prokaryotic universal primer. The bar 

charts show the taxonomic profiles obtained for each of the three pig fecal samples. The arrowheads indicate archaeal reads {Methanobacteria). 
doi:1 0.1 371/journal.pone.01 05592.g001 
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100 




■ Archaea 

■ Bacteria 



Figure 2. Relative abundance ratios of Bacteria and Archaea estimated by NGS witKi tlie prokaryotic universal primer. The results of 
NGS analysis for the three pig fecal samples are compared to those obtained by real-time quantitative PCR (qPCR). 
doi:1 0.1 371 /journal.pone.01 05592.g002 



database (release 10). The match percentages of the previously 
reported Bacteria (341F/R806) and Archaea (ARC344F/ 
Arch806R) domain-specific primers were 97.4% and 63.4%, 
respectively. Thus, the coverage of Archaea sequences by the 
newly designed prokaryotic universal primer was markedly higher 
than that of the archaeal specific primer. 

NGS analysis using the prokaryotic universal primer 

To determine whether Bacteria and Archaea could be 
simultaneously detected using the newly designed prokaryotic 
universal primer, NGS analysis was performed for pig fecal 
samples. Using this primer and MiSeq platform combination, an 
average of 69,330 (±20,482) reads were obtained for each 
sequencing reaction. The reads were then analyzed at the class 
level using the RDP Multiclassifier tool (Fig. 1). The NGS analysis 
indicated that members of the class Clostridia were the most 
dominant taxonomic group in all samples. As shown in Fig. 1, 
reads corresponding to archaeal rRNA genes comprised approx- 
imately 1.2% to 3.2% of prokaryotic reads for all samples analyzed 
using the prokaryotic universal primer. 

Comparison of the prokaryotic abundance ratio 
determined by NGS and q-PCR 

We next analyzed whether the detection rate of Archaea was 
influenced by primer bias by calculating the relative abundance of 
Bacteria versus Archaea sequences using q-PCR. The abundance 
ratio of Archaea was approximately 1.1% to 2.6% of all 
prokaryotes, and the abundance ratios of Bacteria and Archaea 
were consistent with the q-PCR results (Fig. 2). From this result, it 
was determined that the prokaryotic universal primer did not 
display strong primer bias and was able to accurately reflect the 
relative abundance oi Bacteria and Archaea. 



PCR bias of the prokaryotic universal primer for the 
domain Bacteria 

To evaluate the effect of primer bias on the abundance 
estimation of bacterial taxonomic groups, the NGS analysis results 
obtained using the bacterial and newly developed prokaryotic 
universal primers were compared. The estimated abundance ratios 
of the bacterial taxonomic groups based on the sequences obtained 
with either the prokaryotic or bacterial universal primer were in 
agreement for nearly all of the dominant bacterial taxonomic 
groups (abundance-ratio >0.001%) (Fig. 3). However, archaeal 
sequences were also detected in the analyses performed with the 
bacterial universal primer, indicating that primer bias existed, 
although the detection frequency of Archaea was reduced 
compared to that obtained with the prokaryotic universal primer. 
In addition, the detection frequency of Bacteria belonging to the 
phylum Verrucomicrohia, including members of the classes 
Verrucomicrohiae and Opitutae, was higher in the NGS analysis 
using the prokaryotic universal primer than that performed with 
the bacterial universal primer (Fig. 4). 

PCR bias of the prokaryotic universal primer for the 
domain Archaea by NGS analysis and q-PCR 

To evaluate the effect of primer bias on the abundance 
estimation of the two major archaeal taxonomic groups (classes 
Thermoplasmata and Methanobacteria) detected in the NGS 
analysis, the results obtained using the archaeal and prokaryotic 
universal primers were compared. The detection frequency of 
members of the class Methanobacteria was higher in the NGS 
analysis using the prokaryotic universal primer than that in the 
analysis performed with the archaeal universal primer (Fig. 5). 

Class-specific primer sets for q-PCR analysis were also designed 
to estimate and compare the relative abundance ratios of the 
classes Thermoplasmata and Methanobacteria with the NGS 
analysis. The abundance ratios of these two classes in Archaea 
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M5 




Detection rate of each bacterial group 
(prol<aryotic universal primer) Log10 (%) 



Figure 3. PCR bias of the prol<aryotic universal primer for members of tlie domain Bacteria (Class-level NGS analysis). 

doi:10.1371/journal.pone.0105592.g003 

were similar for the NGS analysis using the prokaryotic primer higher using the prokaryotic universal primer in the first fecal 
and the q-PCR analysis using the class-specific primer sets (Fig. 5), samples (Fig. 5, Pigl). In contrast, the estimated abundance ratios 
although the relative abundance oi Methanobacteria was markedly of the classes Thermoplasmata and Methanohacteria by q-PCR 

1.2 I 




Opitutae Verrucomicrobiae 
■ Prokaryotic universal primer ■ Bacterial universal primer 

Figure 4. Relative abundance ratios of members of the classes Verrucomicrobiae and Opitutae for NGS analyses using the prokaryotic 
and bacterial universal primers. 

doi:1 0.1 371/journal.pone.01 05592.g004 
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■ Methanobacteria ■ Thermoplasmata 



Figure 5. Relative abundance ratios of tKie classes Thermoplasmata and Methanobacteria estimated by NGS with the prokaryotic 
universal primer (pro) and archaeal universal primer (arc). 

doi:10.1371/journal.pone.0105592.g005 



markedly differed when compared to the NGS analysis performed 
with the archaeal universal primer. Specifically, the ratio of 
Methanobacteria in the analysis with the archaeal universal primer 
was lower than that estimated using class-specific primers in the q- 
PCR analysis. 

Discussion 

In the present study, we designed and examined the utility of a 
prokaryotic universal primer for the simultaneous amplification of 
bacterial and archaeal 16S rDNA using an lUumina MiSeq next- 
generation sequencer. We found that the newly developed primer 
set matches 98.0% oi Bacteria and 94.6% oi Archaea rRNA gene 
sequences in the RDP database and allows for the simultaneous 
analysis oi Bacteria and Archaea in pig fecal samples. In addition, 
the prokaryotic universal primer has less bias than previously 
described universal primers, a characteristic that is critical for 
obtaining accurate microbial composition data. Thus, the newly 
designed prokaryotic universal primer set is expected to permit the 
simultaneous detection of Bacteria and Archaea in environmental 
samples, and wiU therefore improve the cost and efficiency of 
microbial community structure analysis using NGS technology. 

In the NGS analyses with the newly designed prokaryotic 
universal primer, aU identified taxonomic groups were also 
detected in the sequencing analysis with the bacterial universal 
primer; however, the detection frequency of several taxonomic 
groups markedly differed between the two primer sets. Notably, 
bacteria belonging to the classes Verrucomicrobiae and Opitutae 
were detected in the fecal samples at a nearly five-fold higher 
frequency with the prokaryotic universal primer. The prokaryotic 
universal primer was designed based on a universal primer 
(Uni340F, 5'-CCTACGGGRBGCASCAG-3') described by Ta- 
kai and Horikoshi [23]. However, we speculated that members of 



Verrucomicrobiae and Opitutae, which are increasingly being 
recognized as abundant species in various environments, were 
difificult to detect using primer 34 IF due to a single base mismatch 
with the 16S rRNA gene at the 9th base from the 5' end of the 
bacterial 34 IF primer. Thus, we modified primer Uni340F to 
remove the single mismatch (5'-CCTACGGGNBGCASCAG') 
and confirmed that the relative detection rate of Verrucomicrobiae 
and Opitutae with the modified prokaryotic universal primer 
increased five-fold compared to the bacterial universal primer. 
Several studies have also demonstrated that even a single internal 
nucleotide mismatch in the V3-V4 target region of the primer can 
drastically alter PGR bias [_ENREF_ 1430-32]. For example, a 
comparison of two primer sets targeting the V3-V4 region of the 
16S rRNA gene showed that vastly different fractions of the 
bacterial community in activated sludge were detected [31,33]. In 
addition, differences in the nucleotide sequence of the primer and 
the V3 region strongly affected the microbial community structure 
[14]. We also found that the prokaryotic universal primer set gave 
different abundance ratios for the classes Thermoplasmata and 
Methanobacteria compared to the archaeal universal primer in the 
NGS analysis of the three pig fecal samples. The results of q-PCR 
with specific primers for these two classes suggested that the 
prokaryotic universal primer more accurately estimated the 
abundance ratios than the archaeal universal primer. To confirm 
these findings, however, the analysis of more environmental 
samples using both primers sets is needed. 

NGS analysis with the prokaryotic universal primer indicated 
that members of the class Clostridia were the most dominant 
taxonomic group in all fecal samples. This finding is consistent 
with a previous analysis of pig intestinal microflora based on 16S 
rDNA that found Clostridiales was the dominant bacterial group 
[34-36]. In addition, we found that archaeal rRNA genes 
comprised approximately 1.2% to 3.2% of all prokaryotic reads 
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in the analysis with the prokaryotic universal primer. Increasing 
evidence suggests that archaea are widely distributed in the 
environment [37,38], and it is well-known that archaea are part of 
the intestinal microbiota in humans [38], although a comprehen- 
sive analysis of the archaeal community structure has yet to be 
performed. However, to our knowledge, this is the first report of 
the relative abundance ratio of Archaea in the pig intestinal tract 
using an NGS approach. Several studies have examined microbial 
diversity in pigs using DGGE and 16S rDNA clone libraries 
[36,39]. For example, a relatively recent study examined the 
microbial composition of aerosols in a swine confinement facility 
and found that all detected Archaea sequences belonged to 
methanogenic archaea, which is consistent with our present 
finding of abundant Methanobacteria 16S rRNA in pig fecal 
samples [39]. 

For the accurate determination of microbial community 
structure based on 16S rDNA sequence diversity, sensitive, robust, 
and unbiased PGR amplification of 16S rDNA is critical. In 
particular, a high match percentage of primers is critical to avoid 
bias in cultivation-independent investigations of microbial com- 
munity structure. The present in-silico analysis revealed that the 
coverage rates of the newly designed universal prokaryotic primer 
for sequences in the RDP database were approximately 0.5% 
higher for Bacteria and 0.8% higher for Archaea than those of 
previously described universal primers [23,40]. We also demon- 
strated that the universal prokaryotic primer set designed here has 
markedly lower bias than that of most previously applied universal 
primers [23,40] . It was previously suggested that the presence of a 
single internal primer-template mismatch in the primer extension 
sequence can result in a 1,000-fold underestimation of gene copy 
number, leading to an underrepresentation of the taxonomic 
groups that do not match the universal primer sequence [30] . As 
described above regarding the increased detection rate of 
Verrucomicrobiae and Opitutae, our universal prokaryotic primer 
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